Method and data processing system for providing XML data

ABSTRACT

A method and systems for providing XML data is disclosed. In accordance with an embodiment of the invention, a second data processing system, which is connected to a first data processing system via a network, receives a first request over the network from the first data processing system. The first request comprises specifications for subsequent transfers of XML data from the second data processing system to the first data processing system. The specifications specify for which type of XML documents to be transferred in subsequent transfers to the first data processing system which excerpts of XML data shall be sent. An acknowledge message, sent to the first data processing system from the second data processing system, indicates the latter&#39;s ability to provide the excerpts of XML data for the types of XML documents in the subsequent data transfers.

FIELD OF THE INVENTION

The invention relates to a method of providing XML data to a dataprocessing system, to a computer program product adapted to perform themethod in accordance with the invention, and to a data processingsystem.

BACKGROUND

The extensible markup language (XML) is a W3C-recommended generalpurpose markup language that supports a wide variety of applications.XML languages or ‘dialects’ are easy to design and to process. They arealso reasonably human-legible, and to this end, terseness was notconsidered essential in its structure. XML is a simplified subset ofstandard generalized markup language (SGML). Its primary purpose is tofacilitate the sharing of data across different information systems,particularly systems connected via the (public) internet.

XML provides text based means to describe and apply a tree basedstructure to information. At its base level, all information manifestsas text, interspersed with markup that indicates the information'sseparation into a hierarchy of character data, container like elements,and attributes of those elements.

SAX is a serial access parser API for XML. API is an acronym forapplication programming interface which is a source code interface thata computer system or program library provides in order to supportrequests for services to be made of it by a computer program. SAXprovides a mechanism for reading data from an XML document. A parserwhich implements SAX handles XML information as a single stream of data.This data stream is unidirectional, such that previously accessed datacannot be re-read without re-parsing. SAX is fast and efficient toimplement, but difficult to use for extracting information at randomfrom the XML document, since it tends to burden the application authorwith keeping track of what part of the document is being processed. Itis better suited to situations in which certain types of information arealways handled the same way, no matter where they occur in the document.

DOM is another interface oriented API that allows for navigation of anentire XML document as if it were a tree of ‘node’ objects representingthe document's contents. A DOM document can be created by a parser froman XML document or can be generated manually by users. Data types in DOMnodes are abstract and implementations provide their own programminglanguage specific bindings. DOM implementations tend to be memoryintensive, as they generally require the entire document to be loadedinto memory and constructed as a tree of objects before access isallowed.

RAX which is an acronym for random access XML relates to a further APIfor processing XML data. RAX can be understood as a new parsing modelwhich is neither DOM nor SAX, but a machine-based, micro-based parallelprocessing strategy implemented in purpose built XML hardware. The coretechnology employed in RAX is an XPath engine. XPath is a language foraddressing elements in an XML document. It can be understood as an XMLdocument query language for making assertions about the content in XMLdocuments. RAX processes an XML document by generating XPath expressionsfrom the XML document, such that there is one XPath expressioncorresponding to each item in the XML document, whereby each XPathexpression selects only the corresponding XPath item. The XPathexpressions may be generated with a script. When an item is to beselected from the XML document, the XPath expressions are evaluatedsimultaneously in order to select the item from the generated XPathexpressions. Each XPath expression produces a truth value indicatingwhether it is matched or not. The uniqueness constraint guarantees thatone and only one XPath will have the truth value “TRUE”. The position ofthe matching XPaths provides an index to the corresponding item in theXML document or in a derived list which represents the XML document.

The processing of XML documents is done by the data processing systemwhich receives the XML document, for example via a network from aserver. Such a data processing system is also referred to as applicationsystem as an application executed by the data processing system furtherprocesses the XML document. In order to be able to process large XMLdocuments, the data processing system must however provide sufficientresources which might not always be the case. Furthermore, due to theincreased size of XML documents for example in comparison with raw data,XML based communication requires more network bandwidth and networkprocessing when transferring the XML document from the server to thereceiving data processing system.

The problem of low XML parsing performance is addressed by trying toimplement faster parsers. However, the possibilities for furtherimprovements of these parsers are limited because XML parsing is verysequential by nature and inherently involves a large processingoverhead. Since even with fast parsers, the parsing is still done onapplication systems, these systems are relieved only marginally. It isan object of the invention to describe an improved method of providingXML data to the data processing system corresponding to the applicationsystem which makes use of the XML data. It is a further object of theinvention to provide an improved data processing system.

SUMMARY OF THE INVENTION

According to a first aspect of the invention, there is provided a methodof providing XML data to a first data processing system. In accordancewith an embodiment of the invention, the method is performed by a seconddata processing system which is connected to the first data processingsystem via a network. According to a step of the method, the second dataprocessing system receives a first request over the network from thefirst data processing system. The first request comprises specificationsfor subsequent data transfers of XML data, wherein the specificationsspecify for which type of XML documents which excerpts of XML data fromthe XML documents shall be sent to the first data processing system. Ina further step, the second data processing system sends an acknowledgemessage to the first data processing system. The second data processingsystem indicates via the acknowledge message its ability to provide theexcerpts of XML data of the types of XML documents in the subsequentdata transfers.

The first data processing system can for example be a client systemwhich is served by the second data processing system over the networkvia a publish/subscribe (pub/sub) service. The first and second dataprocessing systems can therefore be parts of a publish/subscribe system,wherein the first data processing system corresponds to a publisherwhich posts requests to the second data processing system which can beseen as a broker that provides responses to the first data processingsystem with respect to the requests.

The first and second data processing systems set up further transfers ofXML data by use of the first request which is acknowledged via theacknowledge message. In the first request, the first data processingsystem specifies which excerpts of XML data it wants to receive when itasks via subsequent requests for the corresponding types of XMLdocuments. The term ‘type of XML documents’ relates to a class of XMLdocuments which can be for example specified according to the type ofthe document type definition (DTD). The first request is therefore usedby the first data processing system to set up the second data processingsystem, which can be regarded as a broker, with respect to whichexcerpts of XML data shall be sent when the first data processing systemrequests via further communication steps for a specific type of XMLdocuments.

In accordance with an embodiment of the invention, the second dataprocessing system receives a second request from the first dataprocessing system. The first data processing system requests via thesecond request for a type of XML documents from the second dataprocessing system. In a further step, the second data processing systemselects the excerpts of XML data from the XML documents having thecorresponding type which shall be sent with respect to the type of XMLdocuments according to the specifications. In a further step, theexcerpts of XML data are sent over the network to the first dataprocessing system. Thus, instead of providing the complete XML documentsof the type requested for by the first data processing system, thesecond data processing system selects the excerpts of XML data that arespecified for the type of XML documents from the corresponding XMLdocuments. The method in accordance with the invention is thereforeparticularly advantageous as the first data processing system onlyreceives the excerpts of XML data and must therefore only process thereceived excerpts of XML data and not the full XML documents. Thus, thefirst data processing system has to employ fewer resources forprocessing the received XML data. Further, fewer network resources haveto be employed for the transfer of the excerpts of XML data incomparison with the network resources that must be employed to transferthe complete XML documents via the network from the second dataprocessing system to the first data processing system.

In accordance with an embodiment of the invention, the type of XMLdocuments relates to an XML document. The first and/or second requestcomprises a document identifier for the XML document. The documentidentifier is employed by the second data processing system to identifythe XML document from which the excerpts of XML data are to beextracted. The document identifier can for example relate to the name ofthe XML document or to the URL (universal resource locator) under whichthe XML document can be retrieved.

In accordance with an embodiment of the invention, the excerpts of XMLdata are sent in one or more data packets. The last data packet of theone or more data packets comprises an end of stream information in orderto indicate to the first data processing system that the last datapacket is the latest data packet sent by the second data processingsystem and that no further data packet will be sent by the second dataprocessing system with respect to the delivery of the excerpts of XMLdata requested via the second request.

In accordance with an embodiment of the invention, the specificationscomprise binary format identifiers, wherein a binary format identifierspecifies the format in which an excerpt of XML data associated with aspecific type of XML documents shall be sent by the second dataprocessing system. The first data processing system specifies thereforein the first request the binary format in which the first dataprocessing system expects to receive the excerpts of XML data associatedwith a specific type of XML documents if it requests for these excerptsof XML data in requests such as for example the above mentioned secondrequest. Formats that can for example be specified by a binary formatidentifier are: UTF-8 encoded strings, UTF-16 encoded strings, signed32-bit integer, signed 64-bit integer, unsigned 32-bit integer, unsigned64-bit integer, single precision IEEE-compliant floating point number,and double precision IEEE-compliant floating point number.

In accordance with an embodiment of the invention, the excerpts of XMLdata are returned in a predefined binary format. This provides theadvantage that no binary format identifiers have to be specified in therequest.

In accordance with an embodiment of the invention, the second requestfurther comprises a characterization for each excerpt of XML data to beextracted by the second data processing system from one or more XMLdocuments. A characterization might for example relate to an elementname of the corresponding excerpt of XML data that shall be selectedfrom one or more XML documents. A characterization might also relate toan XPath expression which can be used to extract the correspondingexcerpt of XML data from an XML document.

In accordance with an embodiment of the invention, the second requestcomprises a filter list, wherein the filter list comprises a set offilter pairs, wherein each filter pair of the set of filter pairsprovides a characterization for an excerpt of XML data and a binaryformat identifier for the excerpt of XML data. The characterization canbe used by the second data processing system to select the correspondingexcerpt of XML data from an XML document. As mentioned above, thecharacterization can for example be an element name specifying theexcerpt of XML data or an XPath expression. The binary format identifierspecifies the format in which the first data processing system wishes toreceive the excerpt of XML data from the second data processing system.

In accordance with an embodiment of the invention, the documents of thecorresponding type requested for via the second request are available tothe second data processing system in an internal data representation.The excerpts of XML data requested with respect to the type of XMLdocuments are generated from the internal data representation of thedocuments. The internal data representation can for example be generatedfrom a specific XML document by applying a SAX API. The internal datarepresentation of this XML document might be stored in the memory whichis accessible by a processor of the second data processing system. Theprocessor can then access the internal data representation and generatethe requested excerpts of XML data. The generation of requested excerptsof XML data directly from the internal data representation has theadvantage that the excerpts of XML data can be extracted very rapidlyfrom the internal data representation and hence, the time required todeliver the requested excerpts of XML data to the first data processingsystem is reduced in comparison with the generation of the excerpts ofXML data from the XML documents.

In accordance with an embodiment of the invention, the documents of thetype requested for via the second request relate to one or more XMLdocuments stored on the second data processing system. The second dataprocessing system parses then the one or more XML documents in order toextract the excerpts of XML data which shall be sent with respect to thecorresponding type of XML documents as set up via the first request. TheXML documents can for example be stored on a storage medium of thesecond data processing system such that a processor running a SAX or DOMparser is able to parse the XML documents and select the specifiedexcerpts of XML data.

According to a second aspect of the invention, there is provided amethod for providing XML data to a first data processing system. Inaccordance with an embodiment of the invention, the method is performedby the first data processing system which is connected to a second dataprocessing system via a network. According to a step of the method, thefirst data processing system sends a first request over the network tothe second data processing system. The first request comprisesspecifications for subsequent data transfers of excerpts of XML data oftypes of XML documents. The specifications specify for which type of XMLdocuments which excerpts of XML data shall be sent to the first dataprocessing system. According to a further step of the method, the firstdata processing system receives an acknowledge message from the seconddata processing system. The second data processing system indicates viathe acknowledge message its ability to provide the excerpts of XML datafor the types of XML documents in subsequent data transfers.

According to a third aspect of the invention, there is provided acomputer program product with computer executable instructions, whereinthe instructions are adapted to perform one of the methods in accordancewith the invention.

In accordance with an embodiment of the invention, the computer programproduct comprises a computer useable medium including a computerreadable program, wherein the computer readable program when executed ona computer causes the computer to:

-   -   receive a first request over the network from the first data        processing system, the first request comprising specifications        for subsequent transfers of XML data from the second data        processing system to the first data processing system, wherein        the specifications specify for which type of XML documents which        excerpts of XML data shall be sent to the first data processing        system, and    -   send an acknowledge message to the first data processing system,        the second data processing system indicating via the acknowledge        message its ability to provide the excerpts of XML data of the        types of XML documents in the subsequent data transfers.

According to a fourth aspect of the invention, there is provided a dataprocessing system, the data processing system relates to a second dataprocessing system in a network which also comprises a first dataprocessing system. The second data processing system comprises means forreceiving a first request over the network from the first dataprocessing system, wherein the first request comprises specificationsfor subsequent data transfers of XML data, wherein the specificationsspecify for which type of XML documents which excerpts of XML data shallbe sent to the first data processing system. The second data processingsystem also has means for sending an acknowledge message to the firstdata processing system, wherein the second data processing systemindicates via the acknowledge message its ability to provide theexcerpts of XML data for the types of XML documents in the subsequentdata transfers.

According to a fifth aspect of the invention, there is provided a dataprocessing system. The data processing system relates to a first dataprocessing system in a network comprising the first data processingsystem and a second data processing system. The first data processingsystem has means for sending a first request over the network to thesecond data processing system. The first request comprisesspecifications for subsequent data transfers of XML data, wherein thespecifications specify for which type of XML documents which excerpts ofXML data shall be sent to the first data processing system. The firstdata processing system further has means to receive an acknowledgemessage from the second data processing system. The acknowledge messageis used by the second data processing system to indicate the ability toprovide the excerpts of the XML data for the types of XML documents inthe subsequent data transfers to the first data processing system.

According to a sixth aspect of the invention, there is provided a proxysystem in a network that also contains at least a first data processingsystem according to embodiments of the invention and another dataprocessing system which might be a second data processing systemaccording to embodiments of the invention. In accordance with anembodiment of the invention, the proxy system is adapted to comprise thefunctionalities of the first and/or second data processing systems inaccordance with the invention. In accordance with an embodiment of theinvention, the proxy system is adapted to perform steps of the methodsin accordance with the invention.

As described above, the first data processing system is adapted to sendthe first request to another data processing system. The first requestis used to specify which excerpts of XML data the first data processingsystem expects to receive when it requests via subsequent requests forthe corresponding types of XML documents. The proxy system is adapted tointercept this first request and to determine whether the other dataprocessing system is a second data processing system in accordance withthe invention. If this is not the case, the proxy system takes the roleof the second data processing system and sends an acknowledge message tothe first data processing system. The proxy system is adapted tointercept a subsequent second request from the first data processingsystem to the other data processing system, to retrieve the XMLdocuments requested for in the second request, to parse those XMLdocuments, for example using a SAX API, and to send the excerpts of XMLdata as specified in the first request back to the first data processingsystem.

If the other data processing system is a second data processing, theproxy system sends the first request to this second data processingsystem, and will intercept and forward subsequent second requests fromthe first data processing system to this second data processing system.

The proxy system is particularly advantageous because the provision ofthe excerpts of XML data by the other (second) data processing system istransparent to the first data processing system. The first dataprocessing system can therefore always set up a transfer of excerpts ofXML data in accordance with the methods in accordance with the inventionif at least a proxy system is comprised in the network and locatedbetween the first data processing system and other data processingsystems since the proxy system is adapted to act as second dataprocessing system in accordance with the invention.

BRIEF DESCRIPTION OF THE DRAWINGS

In the following embodiments of the invention are exemplary described ingreater detail by making reference to the drawings in which:

FIG. 1 is a schematic block diagram of a network comprising a first dataprocessing system and a second data processing system;

FIG. 2 is a flow diagram illustrating steps of a method in accordancewith the invention;

FIG. 3 shows a flow diagram illustrating steps of a method in accordancewith the invention;

FIG. 4 shows a schematic block diagram of a network comprising a firstdata processing system, another data processing system, and a proxysystem;

FIG. 5 shows a schematic block diagram of a network comprising a firstdata processing system, another data processing system, and a proxysystem; and

FIG. 6 shows a schematic block diagram of a network comprising multipledata processing systems and proxy systems.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 1 shows a schematic block diagram of a network 100 comprising afirst data processing system 102 and a second data processing system104. The first data processing system 102 comprises a microprocessor106, and a storage 108. The second data processing system 104 comprisesa microprocessor 110, and a storage 112.

The microprocessor 106 of the first data processing system 102 executesa computer program product 114 which might for example be permanentlystored on the storage 108 and which is loaded for execution into themicroprocessor 106. Similarly, the microprocessor 110 of the second dataprocessing system 104 executes a computer program product 116 which isloaded from the storage 112 into the microprocessor and which ispermanently stored on the storage 112.

The first data processing system 102 and the second data processingsystem 104 are connected over the network connection 118.

The first data processing system 102 can be regarded as a client systemwhich is served by the second data processing system 104 over thenetwork 100 via a pub/sub service. The second data processing system 104might therefore hold XML documents, such as XML document 120 and XMLdocument 121 on the storage 112. Each of the XML documents, such as theXML documents 120 and 121, relates to an XML document type, and hence toa class of XML documents.

The first data processing system 102 might want to receive datacomprised in one of the XML documents held on the second data processingsystem 104 for further usage. In order to set up a communication betweenthe first data processing system 102 and the second data processingsystem 104 through which the first data processing system 102 canrequest XML data from the second data processing system, the computerprogram product 114 generates a first request 122. The first request 122comprises specifications 124.

The specifications 124 specify a first type of XML documents 126, andfurther provide characterizations for excerpts of XML data 128 thatspecify which excerpts of XML data are to be extracted and sent from theXML documents of the first type 126. The specifications 124 furthercomprise a first binary format identifier 129 which specifies the binaryformat in which the excerpts of XML data are to be sent.

Further, the specifications 124 specify a second type of XML documents130, and further provide characterizations for excerpts of XML data 132,and a second binary format identifier 133. The characterizations for theexcerpts of XML data 132 indicate which excerpts of XML data of XMLdocuments of the second type 130, e.g., XML document 121, are to be sentto the first data processing system 102 in the binary format specifiedby the second binary format identifier 133. The first request 122 isthen sent over the network connection 118 to the second data processingsystem 104.

The second data processing system 104 receives the first request 122.The computer program product 116 might then generate an acknowledgemessage 134 which is sent over the connection 118 to the first dataprocessing system in order to indicate the ability to provide theexcerpts of XML data according to the specifications 124 for the typesof XML documents 126 and 130, respectively, in subsequent datatransfers.

The computer program product 114 of the first data processing system 102generates then a second request 136. The second request 136 is used torequest for XML documents of the first type 126. The second request 136is sent over the network connection 118 to the second data processingsystem 104. The computer program product 116 analyzes the second request136 and determines the XML documents held on the storage 112 which areof the first type 126. It is assumed that the first XML document 120 isof the first type. The computer program product 116 selects then items138, 140, and 142 from the first XML document 120, wherein the items138, 140, and 142 correspond to the excerpts of XML documents that havebeen specified by use of the characterizations of excerpts of XML data128 in the previous first request 120 to be sent when it is requestedfor the first type of XML documents 126. The items 138, 140, and 142 arethen packed into data packets 144, 146, and 148. The data packets 144,146, and 148 are transmitted over the network connection 118 to thefirst data processing system 102, so that the first data processingsystem receives the requested excerpts of XML data corresponding to theitems 138 to 142.

FIG. 2 shows a flow diagram which illustrates steps of a method inaccordance with the invention. The method is performed by a second dataprocessing system in a network which further comprises the first dataprocessing system. According to step 200, a first request is receivedover the network from the first data processing system by the seconddata processing system. The first request comprises specifications forsubsequent data transfers of XML data, wherein those specificationsspecify for which type of XML documents which excerpts of XML data shallbe sent to the first data processing system. According to step 202, anacknowledge message is sent to the first data processing systemindicating the ability of the second data processing system to providethe excerpts of XML data for the types of XML documents in subsequentdata transfers.

FIG. 3 shows a flow diagram illustrating steps of a method in accordancewith the invention. The method is performed by a second data processingsystem which is connected via a network with a first data processingsystem. According to step 300, the second data processing systemreceives a second request from the first data processing system, whereinthe first data processing system requests for a specific type of XMLdocuments from the second data processing system. According to step 302,the second data processing system selects excerpts of XML data asspecified in a previous first request for the type of XML documents fromthe documents having the corresponding type. According to step 304, theexcerpts of XML data are sent over the network to the first dataprocessing system.

FIG. 4 shows a block diagram of a network 400 comprising a first dataprocessing system 402, another data processing system 404 which is asecond data processing system in accordance with the invention, and aproxy system 406. The other data processing system 404 can be regardedas a server that holds XML documents, such as XML document 408 on astorage media. The first data processing system 402 can be regarded as aclient which might want to process XML data contained in, e.g., the XMLdocument 408.

In order to set up a communication with the other data processing system404, the first data processing system sends a first request 410 to theother data processing system 404. The first request 410 comprisesspecifications for subsequent data transfers of XML data, wherein thespecifications specify for which type of XML documents requested for viaa subsequent second request which excerpts of XML data shall be sentfrom the other data processing system 404 to the first data processingsystem 402. The first request 410 is intercepted by the proxy system406.

The proxy system 406 comprises a microprocessor 412 and a storage 414.The microprocessor executes a computer program product 416 which isloaded from the storage 414 into the microprocessor 412. The storagefurther comprises a database 418. The computer program product 416 isused to determine, if the other data processing system 404 which is thedestination of the first request 410 is a second data processing systemin accordance with the invention and thus might be able to process thefirst request 410. For this, the computer program product accesses thedatabase 418 in which all data processing systems comprised in thenetwork 400 that are second data processing system in accordance withthe invention are listed. If the other data processing system 404 islisted in the database 418, the proxy system 406 further forwards thefirst request 410 to the second data processing system 404. The seconddata processing system 404 might then send an acknowledge message to thefirst data processing system 402.

In order to receive excerpts of XML data 420 from the XML document 408,the first data processing system 402 might further send a second request422 in which, for example, a type identifier 424 for the XML document408 is included. The second request 422 is also intercepted by the proxysystem 406 which forwards the second request 422 to the other (second)data processing system 404. The other (second) data processing system404 is able to identify the XML document 408 by use of the typeidentifier 424 and extracts the excerpts of XML data 420 from the XMLdocument 408. The excerpts of XML data 420 are then sent from the other(second) data processing system 404 to the first data processing system402.

FIG. 5 shows a block diagram of a network 500 comprising a first dataprocessing system 502, a proxy system 504, and another data processingsystem 506 which is not a second data processing system according to theinvention. The first data processing system 502 can be regarded as aclient who wishes to receive XML data from the other data processingsystem 506 which can be regarded as a server. For this, the first dataprocessing system 502 sends the first request 508 to the other dataprocessing system 506. The first request 508 comprises specificationsfor subsequent data transfers of XML data. The specifications specifyfor which type of XML documents requested for by the first dataprocessing system which excerpts of XML data shall be sent to the firstdata processing system.

The first request is intercepted by the proxy system 504. The proxysystem 504 is then able to determine, for example via a database aspointed out in the description to FIG. 4, that the other data processingsystem 506 is not able to process the first request 508, i.e., that theother data processing system 506 is not a second data processing systemin accordance with the invention.

The proxy system 504 therefore sends an acknowledge message 510 to thefirst data processing system 502 indicating its ability to provide theexcerpts of XML data for the types of XML documents as specified in thespecifications, when the first data processing system 502 requests forit.

When the first data processing system 502 sends a second request 512 bywhich the first data processing system requests for XML data relating toa type of XML documents, the second request 512 is also intercepted bythe proxy system 504. The proxy system 504 then requests the XMLdocuments relating to the type of XML documents as specified in thesecond request 512 from the other data processing system 506. The proxysystem 504 receives the XML documents from the other data processingsystem 506 and extracts the excerpts of XML data according to thespecifications given in the first request from the obtained XMLdocuments. The excerpts of XML data 514 are then sent by the proxysystem 504 to the first data processing system 502.

FIG. 6 shows a schematic block diagram of a network 600. The network 600comprises a data processing system 1 602, a proxy system 1 604, a proxysystem 2 606, a data processing system 2 608, a data processing system 3610, and a data processing system 4 612. The data processing system 1602 can be regarded as a data processing system that comprises thefunctionality of the above described first data processing system. Thedata processing system 4 612 can be regarded as a data processing systemthat comprises the functionality of the above described second dataprocessing system. The data processing system 2 608 can be regarded as adata processing system which is not adapted to implement the methods inaccordance with the invention as described before. The same holds forthe data processing system 3 610.

The proxy system 1 604 can be regarded as a data processing system thatimplements the functionality of the above described proxy system inaccordance with the invention. The same holds for the proxy system 2606.

The data processing system 1 602 might for example wish to receiveexcerpts of XML data from one or more XML documents from the dataprocessing system 2 608. In order to set up a subsequent transfer of theexcerpts of XML data, it therefore sends a first request to the dataprocessing system 2 608. The first request comprises specifications thatspecify for which type of XML document which excerpts of XML data shallbe sent to the data processing system 1 602. The proxy system 1 604intercepts the first request and forwards the first request to the dataprocessing system 2 608. The proxy system 1 604 might not be aware ofthe fact the data processing system 2 608 is not able to process thefirst request. The proxy system 1 604 therefore forwards the firstrequest to the data processing system 2 608 in order to see if itreceives an acknowledge message.

The first request is intercepted by the proxy system 2 606. The proxysystem 2 606 forwards the first request to the data processing system 2608 in order to find out if the data processing system 2 608 is able toprocess and handle the first request. As this is not the case, the proxysystem 2 606 does not receive an acknowledge message from the dataprocessing system 2 608.

The proxy system 2 606 therefore sends an acknowledge message inresponse to the reception of the first request to the proxy system 1604. The proxy system 1 604 sends an acknowledge message to the dataprocessing system 1 602. The data processing system 1 602 as well as theproxy system 1 604 therefore get the knowledge that further secondrequests in accordance with the invention will be served by the dataprocessing system 608 though in fact the proxy system 2 606 will servethese requests. This is however transparent to the systems 602 and 604.

The data processing system 1 602 further sends a second request asdescribed before to the data processing system 2 608. The proxy system 1604 intercepts this second request and forwards the second request tothe data processing system 2 608. The second request is thenintercepted, processed, and answered by the proxy system 2 606.

Similarly, when the data processing system 1 602 requests XML data fromthe data processing system 3 610, the proxy system 1 604 acts as seenfrom the data processing system 1 602 as second data processing systemin accordance with the invention and as seen from the data processingsystem 3 610 as first data processing system in accordance with theinvention. The use of proxy systems in the network therefore ensures theinteroperability between data processing systems in the network, whereinsome data processing systems are adapted to perform the methods inaccordance with the invention and wherein others are not.

A proxy system might not even be located on a separate node of thenetwork 600. For example, the functionality of the proxy system 1 604could be directly implemented into the data processing system 1 602,e.g., via a software component installed on this data processing system.

LIST OF REFERENCE NUMERALS

100 Network 102 First data processing system 104 Second data processingsystem 106 Microprocessor 108 Storage 110 Microprocessor 112 Storage 114Computer program product 116 Computer program product 118 Networkconnection 120 XML document 121 XML document 122 First request 124Specifications 126 First type of XML documents 128 Characterizations ofexcerpts of XML data 129 First binary format identifier 130 Second typeof XML documents 132 Characterizations of excerpts of XML data 133Second binary format identifier 134 Acknowledge message 136 Secondrequest 138 Item 140 Item 142 Item 144 Data packet 146 Data packet 148Data packet 400 Network 402 First data processing system 404 Other dataprocessing system 406 Proxy system 408 XML document 410 First request412 Microprocessor 414 Storage 416 Computer program product 418 Database420 Excerpts of XML data 422 Second request 424 Type identifier 500Network 502 First data processing system 504 Proxy system 506 Other dataprocessing system 508 First request 510 Acknowledge 512 Second request514 Excerpts of XML data 600 Network 602 Data processing system 604Proxy system 606 Proxy system 608 Data processing system 610 Dataprocessing system 612 Data processing system

1. A method of providing XML data to a first data processing system, themethod being performed by a second data processing system, the seconddata processing system being connected to the first data processingsystem via a network, the method comprising: receiving a first requestover the network from the first data processing system, the firstrequest comprising specifications for subsequent transfers of XML datafrom the second data processing system to the first data processingsystem, wherein the specifications specify for which type of XMLdocuments which excerpts of XML data shall be sent to the first dataprocessing system; receiving a second request from the first dataprocessing system, wherein the first data processing system requests toreceive excerpts of XML data with respect to a type of XML documentsfrom the second data processing system; selecting the excerpts of XMLdata according to the specifications from documents having the type;wherein the type of XML documents relates to an XML document, wherein atleast one of the first and second requests comprise a documentidentifier for the XML document, wherein the excerpts of XML data arecomprised in the XML document, wherein the XML document identifierenables the second data processing system to identify the XML document;sending the excerpts of XML data over the network to the first dataprocessing system; and sending an acknowledge message to the first dataprocessing system, the second data processing system indicating via theacknowledge message its ability to provide the excerpts of XML data ofthe types of XML documents in the subsequent data transfers.
 2. Themethod according to claim 1, wherein the excerpts of XML data are sentin at least one data packet, wherein the last data packet of the atleast one data packet comprises an end of stream information, whereinthe end of stream information indicates to the first data processingsystem that this is the last data packet sent by the second dataprocessing system.
 3. The method according to claim 1, wherein thespecifications comprise binary format identifiers, wherein a binaryformat identifier specifies the format in which an excerpt of XML dataassociated with a specific type of XML documents shall be sent by thesecond data processing system.
 4. The method according to claim 1,wherein documents of the type of XML documents requested for via thesecond request are available to the second data processing system in aninternal data representation, wherein the excerpts of XML data requestedwith respect to the type of XML data are generated from the internaldata representation of the documents.
 5. The method according to claim1, wherein the type of documents requested for via the second requestrelates to at least one XML document, wherein the at least one XMLdocument is parsed in order to extract the excerpts of XML datarequested with respect to the type of XML data.
 6. The method accordingto claim 1, wherein a proxy system is comprised in the network, whereinthe proxy system intercepts the first request sent from the first dataprocessing system to another data processing system, wherein the proxysystem determines whether the other data processing system is able toprocess the first request, wherein the first request is sent to theother data processing system from the proxy system if the other dataprocessing system is able to process the first request, and wherein theproxy system further forwards the second request to the other dataprocessing system.
 7. The method according to claim 6, wherein the proxysystem sends the acknowledge message to the first data processingsystem, if the other data processing system is not able to process thefirst request, and wherein the proxy system further processes the secondrequest by requesting the XML documents relating to the type of XMLdocuments requested for via the second request from the other dataprocessing system, by extracting the excerpts of XML data as specifiedin the first request from the XML documents, and by sending the excerptsof XML data to the first data processing system.
 8. A computer programproduct comprising computer executable instructions stored on anon-transitory computer-readable storage medium, the instructions beingadapted to perform a method of providing XML data to a first dataprocessing system, the method being performed by a second dataprocessing system, the second data processing system being connected tothe first data processing system via a network, the method comprising:receiving a first request over the network from the first dataprocessing system, the first request comprising specifications forsubsequent transfers of XML data from the second data processing systemto the first data processing system, wherein the specifications specifyfor which type of XML documents which excerpts of XML data shall be sentto the first data processing system; receiving a second request from thefirst data processing system, wherein the first data processing systemrequests a type of XML documents from the second data processing system;selecting the excerpts of XML data from the XML documents having thecorresponding type which shall be sent with respect to the type of XMLdocuments according to the specifications provided via the firstrequest; wherein the type of XML documents relates to an XML document,wherein at least one of the first and second requests comprise adocument identifier for the XML document, wherein the excerpts of XMLdata are comprised in the XML document, wherein the XML documentidentifier enables the second data processing system to identify the XMLdocument; sending the excerpts of XML data over the network to the firstdata processing system; and sending an acknowledge message to the firstdata processing system, the second data processing system indicating viathe acknowledge message its ability to provide the excerpts of XML dataof the types of XML documents in the subsequent data transfers.
 9. Adata processing system, wherein the data processing system includes asecond data processing system in a network, the second data processingsystem being connected to a first data processing system via thenetwork, the second data processing system comprising: a processor; amemory accessible by the processor, the memory including instructionsstored thereon, the instructions configured to implement the steps of:receiving a first request over the network from the first dataprocessing system, the first request comprising specifications forsubsequent transfers of XML data from the second data processing systemto the first data processing system, wherein the specifications specifyfor which type of XML documents which excerpts of XML data shall be sentto the first data processing system; receiving a second request from thefirst data processing system, wherein the first data processing systemrequests a type of XML documents from the second data processing system;selecting the excerpts of XML data from the XML documents having thecorresponding type which shall be sent with respect to the type of XMLdocuments according to the specifications provided via the firstrequest; wherein the type of XML documents relates to an XML document,wherein at least one of the first and second requests comprise adocument identifier for the XML document, wherein the excerpts of XMLdata are comprised in the XML document, wherein the XML documentidentifier enables the second data processing system to identify the XMLdocument; sending the excerpts of XML data over the network to the firstdata processing system; and sending an acknowledge message to the firstdata processing system, the second data processing system indicating viathe acknowledge message its ability to provide the excerpts of XML dataof the types of XML documents in the subsequent data transfers.
 10. Thedata processing system according to claim 9, the data processing systembeing adapted to send the excerpts of XML data in at least one datapacket, wherein the last data packet of the at least one data packetcomprises an end of stream information, wherein the end of streaminformation provides an indication to the first data processing systemthat no further data packet will be sent by the second data processingsystem.
 11. The data processing system according to claim 9, wherein thespecifications comprise binary format identifiers, wherein a binaryformat identifier specifies the format in which an excerpt of XML dataassociated with a specific type of XML documents shall be sent by thesecond data processing system.
 12. The data processing system accordingto claim 9, wherein documents of the type of XML documents requested forvia the second request are available to the second data processingsystem in an internal data representation, wherein the second dataprocessing system is adapted to generate the excerpts of XML datarequested from the internal data representation of the documents. 13.The data processing system according to claim 9, wherein the type ofdocuments requested for via the second request relates to at least oneXML document, wherein the second data processing system is adapted toparse the at least one XML document in order to extract the excerpts ofXML data requested with respect to the type of XML data.
 14. A dataprocessing system, wherein the data processing system includes a firstdata processing system in a network, the first data processing systembeing connected to a second data processing system via the network, thefirst data processing system comprising: a processor; a memoryaccessible by the processor, the memory including instructions storedthereon, the instructions configured to implement the steps of: sendinga first request over the network to the second data processing system,the first request comprising specifications for subsequent transfers ofXML data from the second data processing system to the first dataprocessing system, wherein the specifications specify for which type ofXML documents which excerpts of XML data shall be sent to the first dataprocessing system; sending a second request from the first dataprocessing system over the network to the second data processing system,wherein the first data processing system requests to receive excerpts ofXML data with respect to a type of XML documents from the second dataprocessing system; selecting the excerpts of XML data according to thespecifications from documents having the type; wherein the type of XMLdocuments relates to an XML document, wherein at least one of the firstand second requests comprise a document identifier for the XML document,wherein the excerpts of XML data are comprised in the XML document,wherein the XML document identifier enables the second data processingsystem to identify the XML document; sending the excerpts of XML dataover the network to the first data processing system; and receiving anacknowledge message from the second data processing system, the seconddata processing system indicating via the acknowledge message itsability to provide the excerpts of XML data of the types of XMLdocuments in the subsequent data transfers.
 15. The data processingsystem according to claim 9, wherein such data processing system is aproxy system.
 16. The data processing system according to claim 14,wherein such data processing system is a proxy system.